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SYSTEM  DESCRIPTION 


SYSTEM  TITLE:  Conversion  Programs  for  SDI 


OBJECTIVES  OF  THE  SYSTEM: 

1.  Reformat  and  convert  the  MEDLARS  Medical  Subject 
Heading  (MESH)  Dictionary  to  BCD  mode,  even  parity. 

2.  Reformat  and  convert  the  Retrieved  Citation  File 
(RCF)  to  BCD  mode,  even  parity. 

3.  Transfer  the  Converted  MESH  Tape  and  Converted 
RCF  File  from  tape  to  disk.  Print  the  files 
when  requested.  Eliminate  duplicate  Main  Heading 
Codes  from  the  Converted  RCF  File. 

4.  Sort  the  Converted  MESH-Disk  file  into  sequence 
by  English  Main  Heading. 


OF  FI  CM  SYMBOL 
SYSTEM  CHART  NO. 

HUN  NUMBER 

PAGE  NUMBER  /  OK  p. 


PROGRAM 

TITLE: 

PURPOSE 


ID:  "MDTCONV" 

"MEDLARS  Medical  Subject  Heading 
(MESH)  Dictionary  Conversion  Program 

OF  PROGRAM:  The  Medical  Subject  Heading  (MESH) 

Dictionary  portion  of  the  MEDLARS 
Dictionary  Tape  (MDT)  is  converted 
from  mixed  mode  (binary  and  BCD), 
odd  parity  to  BCD  mode,  even  parity. 


PROGRAM  DESCRIPTION 


PROGRAM  TITLE:  MEDLARS  Medical  Subject  Heading  (MESH) 
Dictionary  Conversion  Program 


OBJECTIVES  OF  THE  PROGRAM: 

The  input  to  this  COMPASS  conversion  program  is 
the  MEDLARS  Dictionary  Tape  (MDT)  received  periodically 
(usually  monthly)  from  the  National  Library  of  Medicine 
(NLM) ,  for  use  in  the  Selective  Dissemination  of  Infor¬ 
mation  (SDI)  system.  The  tape  is  produced  on  the  NLM 
Honeywell  800  computer.  It  is  mixed  mode  (binary  and 
BCD),  odd  parity.  It  uses  the  Honeywell  character  set 
which  differs  slightly  from  the  CDC  character  set.  The 
tape  must  be  converted  to  BCD  mode,  even  parity  and  the 
differing  characters  must  be  translated  to  be  used  in 
the  COBOL  programs  of  SDI. 

The  MDT  contains  six  separate  subfiles,  one  of 
which  is  the  Medical  Subject  Heading  (MESH)  Dictionary. 
The  other  files  are  not  used  in  the  SDI  system.  The 
MESH  subfile  is  not  separated  from  the  preceding  files 
by  an  end-of-file  indicator. 

The  MESH  input  record  is  variable  in  length  with 
a  maximum  size  of  2056  characters.  However,  only  a  por¬ 
tion  of  the  information  from  the  complete  MESH  record  is 
needed.  This  portion  is  the  "Main  Heading  Code"  and  its 
corresponding  "English  Main  Heading."  The  Main  Heading 
Code  is  a  17-bit  binary  number.  It  is  converted  to  an 
eight  digit  BCD  number.  The  English  Main  Heading  on  the 


MDT  is  BCD  so  it  is  not  necessary  to  convert  it.  Blanks, 
however,  in  the  Honeywell  character  set  are  octal  IS 
which  in  the  CDC  character  set  is  a  therefore,  lS's 

must  be  translated  to  octal  60. 

The  Converted  MESH  Tape  is  used  as  input  to  the 
Converted  Tapes  to  Disk  and  Print  program, 

CVTBBCD,  the  binary  to  BCD  conversion  routine  used 
in  this  program,  is  from  APPENDIX  A,  pp.  A-45,  A-46  of 
the  CDC  COMPASS  Programming  Training  Manual  (Pub. 

No.  60184200). 

Appendix  A  of  this  documentation  gives  a  detailed 
description  of  the  input  MEDLARS  Dictionary  Tape.  Appen¬ 
dix  A  is  a  duplication  of  part  of  NLM's  documentation  of 
the  entire  MDT.  Reference  is  made  to  figures  5,  6  and  7. 
Only  figure  7,  the  format  of  the  MESH  sub -file,  is  include 
in  this  documentation.  Figure  7  follows  Appendix  A. 
Figures  5  and  6  are  not  included  as  they  relate  to  sub¬ 
files  other  than  the  MESH. 


APPENDIX  A 


MEDLARS  Dictionary  Tape  (MPT) 

The  tape  layout  of  the  MDT  conforms  to  many  of 
the  same  conventions  described  in  the  CCF.  This  tape 
however  contains  six  separate  subfiles.  Those  subfiles 
needed  in  conjunction  with  the  CCF  are  the  LANDS  (Lan¬ 
guage  and  Subheading)  file,  JRF  (Journal  Record  File), 
and  the  MeSH  (Medical  Subject  Heading)  file.  These  sub¬ 
files  are  not  separated  by  end  of  file  records. 

The  file  identification,  word  2  in  the  second 
record  on  tape,  contains  the  word  MEDI CTAA .  The  sub¬ 
files  immediately  follow  in  the  next  records.  They  are 
identified  by  sub-file  identification  of  LANDS,  JRFAA . 
or  MeSHA  in  word  2  of  the  record.  Within  the  LANDS 
file  (Figure  5)  there  is  a  LANDS  record  for  each  6-bit 
code  possible.  The  code  is  duplicated  8  times  within 
one  word.  Spaces  appear  in  the  fields  if  the  code  is 
not  assigned.  The  subheading  is  3  words  or  24  charac¬ 
ters.  The  language  abbreviation  is  5  characters. 

The  relative  frequency  of  usage  is  found  in  bits 
31-48  of  word  7.  Beginning  in  word  8  is  a  variable 
length  field  containing  the  8-bit  representation  of  the 
Language  and  the  subheading . 


The  JRF  record,  represented  in  Fig.  6  contains 
the  JTC  (Journal  Title  Code)  in  word  3  of  the  record. 

The  corresponding  JTA  (Journal  Title  Abbreviation)  is 
8  words  in  length  beginning  in  word  9.  This  field  may 
contain  any  of  the  case  codes  and  diacritics  shown  in 
Fig.  4.  The  JRF  normally  would  not  be  necessary  to  the 
use  of  the  CCF  since  the  JTA  is  found  in  the  citation 
body.  However,  it  does  contain  the  complete  list  of 
journals  with  their  code  assignments.  All  words  after 
word  16  pertain  to  publication  information  (frequency, 
last  issues  received,  etc.)  and  the  8-bit  representation 
of  the  Journal  Title. 


Description  of  the  MeSH  Record 

All  records  in  the  MeSH  sub- file  (Figure  7)  con¬ 
tain  a  banner  word  and  two  ortho-words  and  an  end  of 
record  word.  The  description  of  these  can  be  found  in 
the  CCF  section. 

The  MeSH  data  record  is  of  variable  length,  with 
a  maximum  size  of  254  words.  Each  record  contains  only 
one  item. 


Word  by  Word  Description  of  the  Mi SH  Record 
Word  1 : 

A.  Bits  1-30  -  MeSHA.  This  is  the  sub-file 

identification.  It  identifies  the  record  as 
a  part  of  the  MeSH  Sub -file  of  the  MEDLARS 
Dictionary  Tape. 


A-  2 


B. 


Bits  31-32  =  not  used. 


C.  Bits  33-48  =  Item  Word  Count.  This  is  a  I 

count  in  binary  of  the  number  of  computer 
words  in  the  item.  The  EOI  is  included 
in  this  count  but  the  banner  word  is 
excluded. 

i 

[ 

Word  2 :  1 

i 

A.  Bits  1-24  =  Tally.  This  is  a  binary  tally  j 

of  the  number  of  times  that  the  Medical  ! 

Subject  Heading  has  been  used  in  indexing.  i 

1 

1 

B.  Bits  25-41  =  Main  Heading  Code.  This  is  a 

six  digit  octal  number  assigned  to  a  ! 

i 

Medical  Subject  Heading,  (17  bits  only).  j 

This  code  corresponds  to  the  order  of  the  4 

English  Main  Headings  on  the  file.  (Where 

the  MI  1C  and  alternate  MHC  differ,  the  tally 

will  be  found  in  that  record  which  corresponds 

to  the  order  in  which  the  records  are  placed 

by  a  machine  sort  on  English  Main  Headings). 

C.  Bits  42  =  not  used. 

U.  Bits  43-48  =  Type  Code.  This  identifies 

the  transaction  type.  For  a  main  heading, 
the  6  bit  type  code  is  always  1. 


A-  3 


Word  3: 


A.  Bits  1-7  =  Form  Tag  Code.  This  is  a  binary 
code  used  to  indicate  that  the  form  of 
entry  of  citations  in  Index  Medicus  is  not 
the  standard  form  of  entry.  The  code  for 
Non-IM  citations  is  used  to  exclude  the 
citation  from  Index  Medicus.  The  codes  in 
binary  are  the  following: 

0000001  -  Review 
0000010  =  Biographies 
0000011  =  Non-IM  Citations 
0000100  =  Obituaries 

B.  Bits  8-12  =  not  used. 

C.  Bits  13-18  =  Tag  Override  Code.  This  is  a 
binary  code  used  to  safeguard  against  index¬ 
ing  or  typing  errors  which  would  result  in 

a  non-IM  Medical  Subject  Heading  being 
designated  as  an  IM  Heading. 

D.  Bits  19-24  =  not  used. 

E.  Bits  25-41  =  Alternate  Main  Heading  Code. 

This  6  digit  octal  number  distinguished  those 
main  heading  codes  whose  printing  sequence 
differs  from  the  sequence  in  which  they  are 
filed  internally  in  the  computer.  This  is 
the  code  that  is  placed  in  the  CCF  records. 

F.  Bits  42-45  =  not  used. 
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G.  Bit  46  =  Bibliography  of  Medical  Reviews 
Indicator.  This  bit  indicates  that  the 
Main  Heading  has  appeared  in  BMR. 

H.  Bit  47  =  Cumulated  Index  Medicus  Indicator. 
This  bit  indicates  that  the  Main  Heading 
will  appear  in  CIM. 

I.  Bit  48  =  Index  Medicus  Indicator.  This  bit 
indicates  that  the  Main  Heading  has  appeared 
in  Index  Medicus . 

Word  4  : 

A.  Bits  1-6  =  Alternate  Indicator.  This  entry 
is  required  if  the  Medical  Subject  Heading 

is  out-of- sequence  because  of  conflict  between 
library  rules  for  listing  and  the  computer 
rules  for  listing.  The  indicator  will  appear 
as  the  6  bit  code  for  an  equal  sign  (13g)  or 
the  6  bit  code  for  an  asterisk  (54g) .  The 
MHC  and  alternate  MHC  will  differ  at  the 
*term,  and  *tcrm  will  carry  the  tally. 

B.  Bits  7-18  =  not  used. 

C.  Bits  19-24  =  Number  of  Tree  Words.  A  main 
heading  may  have  up  to,  but  not  more  than, 
four  tree  words  or  classification  numbers. 

This  entry  gives  the  number  of  TREE  words 
for  each  main  heading. 
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D.  Bits  25-30  =  Relative  location  of  GRACE  Message 
#1.  This  entry  gives  the  relative  location 
within  the  record  of  the  GRACH  message  without 
categories . 

E.  Bits  31-36  =  Relative  location  of  the  Recur¬ 
ring  Bibliography  Numbers.  This  entry  gives 
the  relative  location  within  the  record  of 
the  Recurring  Bibliography  Numbers. 

F.  Bits  37-40  =  Relative  location  of  GRACE 
Message  #2.  This  entry  gives  the  relative 
location  within  the  record  of  the  GRACE 
message  with  categories. 

Words  5-10: 

These  six  words  are  allowed  for  the  6  bit  English 
Main  Heading  with  trailing  spaces  (48  characters).  This 
field  will  not  contain  any  upper  or  lower  case  indicators 
or  any  special  characters  other  than  the  hyphen,  comma, 
parentheses  or  apostrophe. 

Words  11-12: 

These  two  words  are  used  to  indicate  those  sub¬ 
headings  which  may  be  used  with  this  main  heading.  Bits 
1-48  of  word  11  are  used  for  subheadings  codes  0  through 
47.  Bits  1-16  of  word  12  represent  the  subheadings  whose 
codes  are  48-63.  These  bits  are  used  as  follows.  If  a 
one  appears  in  the  appropriate  bit  position  then  the  sub¬ 
heading  with  the  corresponding  code  may  be  used  with  the 
main  heading.  If  a  zero  appears,  then  usage  of  the 
corresponding  subheading  is  illegal  in  conjunction  with 
this  main  heading. 
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Word  13: 


A.  Bits  1-36  =  Six  Digit  Code.  This  is  the 
Main  Heading  code  in  6  bit  format.  This 
information  is  no  longer  used. 

B.  Bits  37-48  =  Not  Used. 


Tag  Word: 

A.  Bits  1-24  =  Not  Used. 

B.  Bits  2S-27  =  Tag  Group.  This  entry  defines 
the  type  of  main  heading.  The  3  bit  codes 
are  as  follows: 

0  =  Standard  Main  Heading  (IM  and  non-IM) 

1  =  Geographic  Headings  | 

2  =  Public  Health  Service  Headings  (  NOn-IM  only 

3  =  Provisional  Headings  1 

4  =  Form  Tag  or  Type  of  Article  J 

C.  Bits  28-48  =  Tree  Word  or  Classification 
Number.  If  the  heading  lias  been  assigned  to 
a  category  this  21-bit-code  represents  the 
position  this  main  heading  occupies  in  a 
tree  structure.  The  number  is  in  the  form 
AXX.XXX.XX.X  (A  =  alphabetic  character  and 

X  =  decimal  character).  The  first  part  of 
the  number  defines  the  category  to  which  the 
tag  belongs,  and  the  other  sections  define 
the  level  of  the  number.  (See  detailed 
description  of  CCF  record).  A  main  heading 
may  have  up  to  4  classification  numbers,  each 
one  occupying  bits  28-48  of  a  separate  word. 
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GRACli  Message  #1: 


Bits  1-8  of  the  first  word  of  this  message  give 
in  octal  representation  the  number  of  GRACE  lines  used 
in  the  message.  Bits  9-1G  give  an  octal  count  of  the 
8  bit  characters  in  the  first  GRACE  line.  Bits  17-24 
give  an  octal  count  of  the  8  hit  characters  in  the 
second  GRACE  line  and  etc.,  depending  on  the  number  of 
GRACE  lines  in  the  message.  Following  this  is  the 
Main  Heading,  (without  categories  attached)  in  8-bit 
GRACE  code. 


GRACE  Message  #2: 

Bits  1-8  of  the  first  word  of  this  message  give 
in  octal  the  number  of  GRACE  lines  in  the  message.  Bits 
9-16  give  an  octal  count  of  the  8  bit  characters  in  the 
first  GRACE  line.  Bits  17-24  give  an  octal  count  of 
the  8-bit  characters  in  the  second  GRACE  line  and  etc. , 
depending  on  the  number  of  GRACE  lines  in  the  message. 
Following  this  is  the  Main  Heading  with  categories 
attached,  in  the  8  bit  GRACE  code. 


Recurring  Bibliography  Word: 

This  field  contains  a  6-bit  bibliography  number 
and  an  8-bit  sub-bibliography  number  for  every  bibliography 
the  main  heading  qualifies.  There  are  three  bibliography 
and  sub-bibliography  numbers  per  word.  Immediately 
following  this  variable  field  is  the  END  OP  ITEM  WORD. 
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PROGRAM  ID: 


"RCFCOXV" 


TITLE:  Retrieved  Citation  File  (RCF) 

Conversion  Program 

PURPOSE  OF  PROGRAM:  The  Retrieved  Citation  File  (RCF) 

is  converted  from  mixed  mode  (binary 
and  BCD),  odd  parity  to  BCD  mode, 
even  parity. 


PROGRAM  DESCRIPTION 


PROGRAM  TITLE:  Retrieved  Citation  File  (RCF)  Conversion 
Program 


OBJECTIVES  OF  THE  PROGRAM: 

The  Retrieved  Citation  File  (RCF)  is  used  as 
input  to  this  COMPASS  program.  The  file  is  produced 
monthly  on  the  National  Library  of  Medicine  (NLM)  Honey¬ 
well  800  computer.  It  is  in  mixed  mode  (binary  and  BCD), 
odd  parity  and  uses  the  Honeywell  character  set  which 
differs  from  the  CDC  character  set.  The  RCF  tape  must 
be  converted  to  BCD  mode ,  even  parity  and  the  characters 
differing  must  be  translated  for  use  in  the  COBOL  programs 
of  the  Selective  Dissemination  of  Information  system. 

The  RCF  input  record  is  variable  in  length  with  a 
maximum  size  of  2056  characters.  The  records  which  are 
to  be  converted  and  translated  are  recognized  by  a  "R" 
in  character  13  of  the  RCF  record. 

Only  certain  fields  within  the  Retrieved  Citation 
File  record  are  necessary  for  the  SDI  system.  The 
"Citation  Number,"  citation  body  (this  consists  of  the 
author,  title,  volumes,  pages,  publication  date  and  other 
bibliographic  information),  "Number  of  Tag  Words,"  and 
"Main  Heading  Codes"  (these  vary  according  to  the  number 
of  tag  words)  are  the  fields  used. 


The  Citation  Number  is  a  20 -bit  binary  number 
which  is  converted  to  an  eight  digit  BCD  number.  Like¬ 
wise,  the  Number  of  Tag  Words  must  be  converted  from  a 
4-bit  binary  number  to  a  two  digit  BCD  number  and  the 
Main  lieading  Codes  which  are  17-bit  binary  numbers  must 
be  changed  to  eight  digit  BCD  number.  Each  character  of 
the  citation  body  has  to  be  checked  to  determine  if  it 
must  be  translated  from  the  Honeywell  octal  value  to  the 
CDC  octal  value.  The  Honeywell  octal  values  that  are  trans¬ 
lated  to  a  CDC  octal  value  are  shown  in  Table  B-l. 

The  output  converted  RCF  File  is  a  fixed  length 
record  although  the  nun, her  of  Main  Heading  Codes  varies 
and  the  length  of  the  citation  body  is  not  fixed.  The 
unused  portions  of  the  output  record  are  blank  filled  to 
insure  acceptable  records  for  the  Converted  Tapes  to  Disk 
and  Print  COBOL  program. 

A  maximum  of  11,000  records  are  written  on  the 
first  output  file.  Wlien  this  limit  is  reached  another 
scratch  tape  is  required  to  process  the  remaining  input 
records . 

CVTBBCD,  the  binary  to  BCD  conversion  routine  used 
in  this  program,  is  from  APPENDIX  A,  pp .  A-45,  A-46  of 
the  CDC  COMPASS  Programming  Training  Manual  (Pub . 

No.  60184200). 

Appendix  B  gives  a  detailed  description  of  the 
input  Retrieved  Citation  File.  It  is  a  duplication  of 
NLM's  documentation  of  the  RCF.  The  Figure  1  referenced 
in  Appendix  B  immediately  follows  the  Appendix. 


APPliND IX  B 


Retrieved  Citations  File 

The  Retrieved  Citations  File  is  the  final  file  of 
the  Demand  Search  Module.  It  contains  the  requestor 
IDs  and  the  retrieved  citations. 


Description  of  the  RCF  Data  Record 

The  RCF  data  record  is  of  variable  length.  The 
maximum  record  length  is  254  words.  Each  citation  record 
contains  one  RCF  item  (detailed  in  Figure  1). 


Word  by  Word  Description  of  RCF  Item 

Word  1 : 

A.  Bits  1-18  =  Request  Number.  Binary  represen 
ration  of  the  number  of  the  request. 

B.  Bits  19-24  =  Sub-Search  Code.  Alpha  represen¬ 
tation  of  the  Sub-search  code  word  A,  B,  or  C . 

C.  Bits  25-3C  =  R.  Alpha  representation  indi¬ 
cating  that  this  is  the  Request  Statement. 

D.  Bits  51-40  =  Item  Word  Count.  Binary  count 
of  computer  words  in  this  item.  This  count 
does  not  include  the  banner  word,  but  docs 
include  the  DOI  word. 
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Word  2: 


A.  Bits  1-17  =  Place  of  publication.  Seventeen 
bit  binary  code  whose  English  equivalent  may 
be  found  in  the  MeSH  file  of  the  MDT  as  a 
main  heading  code. 

B.  Bits  18-29  =  Year  of  publication.  Binary 
representation  of  year. 

C.  Bits  30-32  =  not  used. 

D.  Bits  33-48  =  Item  Word  Count.  Binary  count 
of  computer  words  in  this  item  on  the  CCF 
file. 


Word  3: 

A.  Bits  1-20  =  Citation  number.  A  binary  number 
assigned  serially  to  each  citation. 

B.  Bits  21-24  =  Sub-search  code.  Binary  repre¬ 
sentation  of  the  Sub- search  code  representing 
A,  B,  or  C  for  Report  Generator. 

C.  Bits  25-42  =  Journal  Title  code.  Three 
alphanumeric  characters  which  represent  the 
equivalent  journal  name. 

D.  Bits  43-48  =  Language  code.  Six  bit  binary 
code  whose  English  equivalent  may  be  found 
in  the  LANDS  file  of  the  MDT. 
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Word  4: 


A.  Bits  1-7  =  Form  code.  Seven  bit  binary 
number  denoting  form.  At  present  these 
codes  represent: 

0  =  Standard 

1  =  Review 

2  =  Biography 

3  =  Famous  Persons 

4  =  Obituary 

B.  Bits  8-12  *  Subform  code.  The  binary 
codes  represent: 

0  =  Standard 

1  =  Anonymous 

2  =  Non-standard  pagination 

3  =  Anonymous  non-standard  pagination 

C.  Bits  13-18  «  Indexer  Number. 

D.  Bits  19-24  =  Typist  Number. 

E.  Bits  25-57  =  Date  of  Entry.  Binary  repre¬ 

sentation  of  date  citation  entered  the 
system.  The  code  is  in  the  form: 

Bits  2S-28  =  year,  where  1962=0,  1963=1,  etc. 
Bits  29-32  =  month 

Bits  33-37  =  day 
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F.  Bits  38,33  =  not  used. 

G.  Bits  40,41  =  Type  of  Author.  These  codes 
are  assigned  as  follows: 

0  =  Standard 

1  =  Compiler 

2  =  Editor 

3  =  Both  Compiler  and  Editor 

H.  Bit  42  =  Indicator.  This  bit  (1)  indicates 
a  journal  not  routinely  indexed. 

I.  Bits  43-48  =  Number  of  Tag  Words.  Binary 
count  of  words  which  immediately  follow  this 
one  and  which  contain  main  heading  codes. 

The  number  of  tag  words  is  variable. 


Word  5:  (Variable  number  of  these  words) 

A.  Bits  1-17  =  Main  Heading  Code.  This  17  bit 
code  may  be  found  on  the  MeSH  file  of  the 
MDT  with  its  English  equivalent. 

B.  Bits  18-23  =  Subheading  code.  Six  bit  binary 
code  representing  the  Subheading.  This  code 
and  its  English  equivalent  may  be  found  in 
the  LANDS  file  of  the  MDT.  (Not  used  at 
present . ) 


B-4 


C.  Bit  24  =  IM  Indicator.  0  a  a  non-IM  tag; 
1=IM.  The  setting  indicates  whether  or  not 
this  citation  will  appear  in  INDEX  MEDICUS 
under  this  main  heading. 

D.  Bits  25-27  =  Tag  Group.  A  3  bit  code  repre¬ 
senting  the  tag  group.  These  are  as  follows: 

0  =  Standard  main  heading 

1  =  Geographic  main  heading 

2  =  Public  Health  support  main  heading 

3  =  Provisional  main  heading 

4  =  Type  of  article 

E.  Bits  28-48  =  Classification  Number.  A  21  bit 
code  representing  the  position  this  main 
heading  occupies  in  a  tree  structure,  if  the 
heading  has  been  assigned  to  a  category. 

The  classification  number  is  in  the  form: 

AXX .  XXX.  XX.  X  where  A  =  alphabetic  charac¬ 
ter  and  X  =  decimal  character.  The  first 
part  of  the  number  defines  the  category  to 
which  the  tag  belongs.  The  other  sections 
define  the  level  of  the  number.  Within  the 
21  bits  the  number  is  broken  into  a  7. 7. 6.1 
allocation.  The  first  7  bits  represent  the 
category  assigned  to  this  code.  The  next 
seven  bits  represent  the  number  assigned  at 
the  second  level;  the  next  6  bits  indicate 
a  third  level  term;  the  last  bit  indicates 
the  tag  is  a  fourth  level  term.  These 
sections  are  zero  if  not  used. 


A  main  heading  may  have  a  variable  number  of 
classification  numbers  up  to  4.  For  each 
number  assigned  to  the  main  heading  occupies 
a  new  word.  For  each  separate  subheading 
assigned  with  a  main  heading  a  new  word  is 
created.  The  maximum  number  of  tag  words 
is  32 . 

The  classification  number  is  located  on  the 
MeSIl  file  with  its  main  heading. 


Citation  Body:  (For  further  detail,  consult  the 

Description  of  the  CCF  Item) 

The  remainder  of  the  words  in  the  item  comprise 
the  citation  body.  This  information  is  in  6-bit  code 
packed  8  to  a  word.  The  fields  are  separated  by  special 
code  configurations. 

1.  Author.  There  are  two  types  of  author 
names.  The  sort  author  is  preceded  by  a 
left- justified  (73g).  The  print  name 
appears  only  if  the  name  does  not  conform 
to  the  pattern  under  sort  name.  An  octal 
77  left- justified  in  the  word  marks  the 

end  of  the  authors  or  indicates  an  anonymous 
author . 

2.  Title.  Field  is  a  variable  number  of  charac¬ 
ters  beginning  with  the  77g  and  ending  with 
37g.  This  field  contains  the  title  of  the 
article  in  English  ’with  all  code  possibili¬ 
ties  detailed  in  Figure  2. 
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3. 


Journal  Title  Abbreviation.  Field  is 
enclosed  by  37g  and  5 6 g . 

4.  Volume.  This  field  begins  with  5Cg  and 
ends  with  S7g.  It  denotes  the  volume  number 
of  the  journal. 

5.  Pagination.  This  field  represents  the  page 
numbers  of  the  article  to  which  the  citation 
refers.  It  is  enclosed  by  a  S7g  and  a  76g. 

6.  Publication  Date.  The  date  of  publication 
is  enclosed  by  a  76g  and  36g. 

7.  Vernacular.  This  field  is  the  vernacular 
version  of  the  foreign  language  title,  if 
the  article  is  foreign.  It  ends  with  a 
53g. 

8.  References.  This  field  contains  the  number 
of  references  made  to  other  material  if  the 
citation  is  a  review  article.  The  field  is 
enclosed  by  a  53g  and  a  52g.  It  is  not 
necessary  for  each  field  to  be  present.  If 
a  field  is  missing  only  the  begin  and  end 
symbols  are  present. 

9.  Issue  Number.  Right  justified  twelve  bits 
of  the  EOI  word  are  used  to  express  the 
issue  number. 
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CHARACTER  SET  TRANSLATION  TABLE 


FOR  NLM'S  RETRIEVED  CITATION  FILE  TAPE 


HONEYWELL  CHARACTER  (OCTAL  VALUE)  CDC  CHARACTER  (OCTAL  VALUE) 


A 

> 

% 


CONTROL 

CHARACTERS 


CITATION 

BODY 

ITEM 

SEPARATORS 


A  (a  blank) 
A 


¥ 

IGNORED 

IGNORED 

IGNORED 

IGNORED 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 


60 

60 

37 

16 

14 


60 

60 

60 

60 

60 

60 

60 

60 

60 

60 


TABLE  B - 1 


PROGRAM  ID:  "CVTTODAP" 

TITLE:  Converted  Tapes  to  Disk  and  Print 

PURPOSE  OF  PROGRAM:  The  Converted  MESH  Tape  and  the 

Converted  RCF  File  are  transferred 
from  tape  to  disk  with  the  options 
to  print  none  of  the  tapes,  either 
tape  or  both  tapes. 


PROGRAM  DESCRIPTION 


PROGRAM  TITLE:  Converted  Tapes  to  Disk  and  Print 

OBJECTIVES  OF  THE  PROGRAM : 

The  Converted  MESH  Tape  and  the  Converted  RCF 
File  are  the  input  tapes  to  this  COBOL  program.  The 
records  of  both  files  are  written  to  disk  in  the  same 
format,  but  blocked  for  use  in  the  Selective  Dissemin¬ 
ation  of  Information  system. 

The  option  to  print  the  files  exists.  The 
character  typed  in  response  to  the  console  message 
determines  the  printed  output  as  follows: 

0  -  Neither  Tape  Printed 

1  -  Converted  MESH  Tape  Printed 

2  -  Converted  RCF  File  Printed  (First  100 

Records ) 

3  -  Both  Tapes  Printed 

When  the  Converted  RCF  File  is  transferred  to 
disk,  a  check  is  made  to  determine  if  duplicate  "Main 
Heading  Codes"  exist.  All  duplicate  codes  are  eliminated 
and  the  "Number  of  Main  Heading  Codes"  is  adjusted 
accordingly.  Only  the  first  one-hundred  (100)  records 
of  the  Converted  RCF  File  are  printed  when  an  option  to 
print  the  tape  is  given. 

The  Converted  MESH  Tape  is  transferred  record 
by  record  to  disk.  The  complete  tape  is  printed  under 
options  "1"  and  "5." 


PROGRAM  ID: 


"MESHSORT" 


TITLE:  Sort  Converted  MESH-Disk  File  by 

English  Main  Heading 

PURPOSE  OF  PROGRAM:  This  sort  puts  the  Converted  MESH- 

Disk  file  in  ascending  order  by 
standard  BCD  collating  sequence. 
The  sort  key  is  the  English  Main 
Heading  (positions  01-48). 


3150  OPERATING  INSTRUCTIONS 


MAX. TIME _ 20 


RUN  ID  SD1  1 


REQUESTER _ 

COST  CENTER  5730 
TORE /BACK _ 


SEQUENCE  NO. 


RUN  NAME 


MEDLARS.  Medical  Subject  Heading 
(MESH)  Dictionary  Conversion  Progra; 


ATTEND 


AUTH. 


__  WORK  CODE  000 
N£Of  /OTHER _ 


CARDS: 

Ik  s  label 

OUT:  I. A3  EH. . _ 

DISK: 


PACK  NO. _ 


PRINTER: 
LABEL _ 


PAPER  TAJ’S: 


OUT:  LABEL 


DISPOSITION 


DISPOSITION 


FORM  NO. 


PARTS 


CARRIAGE  TAPE 


DISPOSITION 


DISPOSITION 


SMUFD  FORM  2006  (Test) 
(Aug  68) 


! 


HALTS: 

HST" 


CAUSE/CORRECTIVE  ACTION 


OPERATOR  COMMENTS 


3150  OPERATING  INSTRUCTIONS 


MAX. TIME _ 35 _  SEQUENCE  NO. _ 

Retrieved  Citation  File  (RCF) 
RUN  ID  SD12  HUN  NAME  Conversion  Program 

REQUESTER _ _ _ _ _  ATTLI'"J _  AUTH. _ 

COOT  CENTER  5  7  30 _  ^ORK  CODE  Q  0  0 _  STEP  2 

FORE/BACK _ _ _  NE  OF /OTHER _ 


JUMP  SWITCH:  1. _ ,2. _ ,3. _ ,4. _ ,5. 


MAG. TARE: 


IN: 

UNIT 

1 

LUN 

1 

LABEL 

Retrieved 
Citation  Fi 

jLEL 

1  c 

RING 

DISPOCITION 

2 

2 

CARDS: 

IN :  LABEL  DISPOSITION 


OUT:  LABEL 


DISPOSITION 


DISK: 

PACK  NO. 

y 

y 

y 

PRINTER: 

LABEL 

FORM  NO. 

3 

(•) 

CARRIAGE  TAPE 

PAPER  TAPS: 

IN:  LAfeEL  DISPOSITION 


OUT:  LABEL _  DISPOSITION _ 

SPECIAL  INSTRUCTIONS:  When  console  message  requests  NEW  tape, 
dismount  LUN  2  and  SAVE,  Mount  new  scratch  tape,  ready  and 
press  GO. 


SMUFD  FORM  2006  (Test) 
(Aug  68) 


HALTS: 

mr~ 


CAUSE/CORRECTIVE  ACTION 


MAX. TIME  90 

RUN  ID  SD1 3 

REQUEST  '.'JR _ 

COST  CENTER  5  7  30 
FORE/BACK _ 


3150  OPERATING  IK r.T RUCTIONS 

SEQUENCE  NO. _ 

RUN  NAME  Converted  Tapes  to  Disk  and  Print 
_  ATTEND _  AUTH. _ 

_ dork  code  pop _ step  3 _ 

_  ML; Of  /OTHER _ 


JUMP  SWITCH:  i. _ ,2. _ ,3. _ >^. _ ,5. 

MAG. TARE: 


IN: 

UNIT 

1 

I.UK 

1 

LA3EL  i 

Converted 

RTF  Pi  1  n _ 

RING 

DISPOSITION 

2 

2 

RiaSnBnHSf* 

■ 

CARDS: 

IK:  LABEL  DISPOSITION 


OUT:  LABEL  DISPOSITION 


DISK: 

PACK  NO.  1999  tracks  >  20  0  tracks  >  _ »  _ 

PRINTER: 

LABEL  FORM  NO.  STD  PARTS  1  CARRIAGE  TAPE  STD 


PAPER  TAPS: 

IN:  LABEL _  DISPOSITION 

OUT:  LABEL  DISPOSITia! 


S  PEC  I AL  IN'ST  RUCT  IONS  Type  In  number  when  requested  as  follows: 

O  -  Neither  tape  to  be  printed 

1  -  Converted  MLSI1  Tape  to  be  printed  (LUN  2) 

2  -  Converted  UCF  File  to  be  printed  (LUN  1) 

3  -  Both  Tapes  to  be  printed 


SMUFD  FORM  °006  (Test) 
(Auc  66) 


HALTS! 

HE — '  CAUSE /CORRECTIVE  ACTION 

1.  The  Physical  Limits  of  the _ Reallocate  and 


CONVRCFDISK  File  Have  Been 

Rerun 

Exceeded.  You  Must  Allocate  . 

More  Space  and  Rerun  the  Program. 

The  Program  Stops. 

2.  The  Physical  Limits  of  the 

Reallocate  and _ 

CONVMDTDISK  File  Have  Been 

Rerun 

Exceeded.  You  Must  Allocate 

More  Space  and  Rerun  the  Program. 

OPERATOR  COMMENTS 


3150  OPERATING  INSTRUCTIONS 


MAX. TIME  i  t;  SEQUENCE  NO. _ 

Sort  Converted  MESH  -  Disk 

RUN  ID _ SD14 _ RUN  NAME  File  by  English  Main  Heading 

REQUESTER  _  ATTEND _  AUTrt. _ 

COST  CENTER  5730 _  WORK  CODE  000 _  STEP  4 

FORE/BACK _  MS  OF /OTHER _ 

JUMP  SWITCH:!. _ ,2. _ ,3. _ A _ ,5. _ 


MAG. TARE: 


IN: 

UNIT 

LUN 

LABEL 

RING 

DISPOSITION 

33T: 

■ 

■ 

CARDS; 

IN-  LABEL _  DISPOSITION 

OUT:  LABEL  DISPOSITION 


DISK: 
PACK  NO. 


PRINTER: 

LABEL _  FORM  NO. _  PARTS _  CARRIAGE  TAPE 

PAPER  TAPE: 

IN:  LABEL _  DISPOSITION _ 

OUT:  LA3EL _  DISPOSITION _ _ 

SPECIAL  INSTRUCTIONS : 


SMUFD  FORM  2006  (Test) 
(Aug  68) 


op 


AlfV  j 


IVZVM 

\ 

Z£ROS 

_ s. 

/ _ . 

Ljo^d 

\ZUtrn  T^JPOT 

ffam  fe/90J»c-  Cve 

_ N 

/ 

Shift  A(?~ 
Rxokt 

'l-Birs 

_ 's 

/ _ 

Surer  Ap- 

i-£FT 

AV-srrS 


OKK1CK  SYMBOL 
SYSTEM  CHART  NO. 
RUN  NUMBER 
PACK  NUMBER 


SYSTKM  CHART  NO. 
RUN  NUMHKIt 
PACK  NUMHKIt 


srone 


SYSTEM  CHART 
RUN  NUMBER 
RAGE  NUMBER 


OKl’ICK  SY MliOl, 
SYSTEM  CHART  NO. 

RUN  NUMBER  _ 

PAGE  NUMBER 


SrtTFT  fiQ- 

iLrc-c<T 


OFFICE  SYMBOL,  _ _ _ _ 

SYSTEM  CHART  NO. 

RUN  NUMBER 

PAGE  NUMBER  !?~  OF 


OKK1CK  SYMHOIj  __ 
SYSTEM  CHART  NO. 

RUN  NUMBER  _ 

PAGE  NUMBER 


OK  KICK  SYMBOL  _ 

SYSTEM  CHART  NO. 

KEN  NUMBER  _ SP/A 

PAGE  NUMBER  //  OK  ^ 


OFFICE  SYMBOl, 
SYSTEM  CilAUT  NO 
RON  NUMBER  _ 
PAGE  NUMBER 


OFFICE  SYMBOL 
SYSTEM  CHART  NO. 

RUN  NUMBER  S  £>  /  A 

PAGE  NUMBER  /  */  OF 


V  f 


A3. 

£  OK 


OKKICE  SYMHOIi  __ 
SYSTEM  CHART  NO. 

RUN  NUMBER  _ 

PAGE  NUMBER 


Taj  oe\  2. 


OFFICE  SYMBOL. 
SYSTEM  CHART  NO 
RUN  NUMBER 
PAGE  NUMBER 


OFFICE  SYMBOL  _ 

SYSTEM  CHART  NO.  _ 

RUN  NUMBER  _ ; 

PAGE  NUMBER  _ A 


OF  ^2^- 


\ssirenA' 


ZtAcry 


epreft  A- 


lAJxrn 


SYSTEM  C1IAKT 
RUN  NUMR1JK 
RAGE  NUMHEK 


OFFICft  SYMBOL 
SYSTEM  CHART  NO 
RUN  NUMBER 
PAGE  NUMBER 


f! 


Rep  Ftt,£ 


OFFICE  SYMBOL 
SYSTEM  CHART  NO, 

HUN  NUM HHK  _ 

PACE  NUMBER 


STOP 


OFFICE  SYMBOL 
SYSTEM  CHART  NO. 

RUN  NUMBER  _ 

PAGE  NUMBER 


OKK1CK  SYMBOL 
SYSTEM  CHART  NO 
RUN  NUMBER 
RAGE  NUMBER 


OF  KICK  SYMBOL 
SYST12M  CHART  NO 
RUN  NUMBKR 
PACK  NUMBKR 


Moi/tr  Ol 
To 

tft+Ct/T/il. 


OKKICH  SYMBOL 
SYSTKM  ChAKT  NO 
RUN  NUMBKU  _ 
PAG)':  NUMBER 


$Db  0. 


MHcht*.  3. 
ncos 


Move-  oL 


5u3  rRAc  r 
i.  Fftofl 

AfCMHc.0  S 


OFFICE  SYMBOL 
SYSTEM  CIIAUT  NO 
RUN  NUMBER 
PAGE  NUMBER 


co/wefc7tt> 


OFP1CE  SYMBOL 
SYSTEM  CHART  NO 
RUN  NUMBER  __ 
PAGE  NUMBER 


OF  KICK  SYMBOL 
SYSTEM  CHART  NO 
RUN  NUMBER 
FACE  NUMBER 


OFFICIO  SYMBOL 
SYSTEM  CHART  NO 
RUN  NUMBER  __ 
PAGE  NUMBER 


MED1A:  ~Ta  Pe/QXSK _ OATA  PROCESSING  STORAGE  FORMAT 

TITLE:  /'yiyVE<  T'en  tfDT  7~/ff>cT 


FILE  NAME: 
RECORD  LENGTH. 
BLOCKING  FACTOR; 


iff*  0  t*/o 


UNCLASSIFIED 


Security  CUuificatlon 


DOCUMENT  CONTROL  DATA  •  R  &  D 

( Security  cImiIHmiIm  of  till*,  Uxfjr  of  At«li*ct  mnd  Intoning  tmalatlan  mu  at  t«  enter*  it  whft  ll i»  o  reran  report  ie  claaet/terfj 


».  ORISINATtN*  ACTIVITY  fCwpomM  auEMVJ  ft*.  RieOMT  SECURITY  CLASSIFICATION 

UNCLASSIFIED 

Wolf  Research  and  Development  Corporation  it>.  »roup 

Not  Applicable 


»•  REPORT  TITLE 

Selective  Dissemination  of  Information  System  -  VOLUME  II 
(Conversion  Programs  for  SDI) 
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